Learning Wake-Sleep Recurrent Attention Models

نویسندگان

  • Jimmy Ba
  • Ruslan Salakhutdinov
  • Roger B. Grosse
  • Brendan J. Frey
چکیده

Despite their success, convolutional neural networks are computationally expensive because they must examine all image locations. Stochastic attention-based models have been shown to improve computational efficiency at test time, but they remain difficult to train because of intractable posterior inference and high variance in the stochastic gradient estimates. Borrowing techniques from the literature on training deep generative models, we present the Wake-Sleep Recurrent Attention Model, a method for training stochastic attention networks which improves posterior inference and which reduces the variability in the stochastic gradients. We show that our method can greatly speed up the training time for stochastic attention networks in the domains of image classification and caption generation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی شیوع اختلالات خواب و اختلالات یادگیری عصب روان‌شناختی در کودکان پیش از دبستان

Introduction: The prevalence of sleep disorders is different in international studies. Sleep disorders with the increasing prevalence among children is common. Cognitive problems are the most serious complication of sleep disorders in children. The present study, the prevalence of sleep problems and neuropsychological learning disabilities were evaluated on pre-school children (4-6 years old) i...

متن کامل

A wake-sleep algorithm for recurrent, spiking neural networks

We investigate a recently proposed model for cortical computation which performs relational inference. It consists of several interconnected, structurally equivalent populations of leaky integrate-and-fire (LIF) neurons, which are trained in a selforganized fashion with spike-timing dependent plasticity (STDP). Despite its robust learning dynamics, the model is susceptible to a problem typical ...

متن کامل

Comorbidity of Non-24-hour Sleep-wake Syndrome and Seasonal Affective Disorder in a Young Man: a Case Report

Objective: Few clinical reports have described in detail the comorbidity of seasonal affective disorder (SAD) and non-24-hour sleep-wake syndrome (non-24-SW). Both SAD and non-24-SW are thought to be caused by the interplay between internal clock dysfunction and insufficient external time cues. The aim of this study is to present and discuss in detail a subtype of psychiatric comorbidity and it...

متن کامل

Factor Analysis Using Delta-Rule Wake-Sleep Learning

We describe a linear network that models correlations between real-valued visible variables using one or more real-valued hidden variables-a factor analysis model. This model can be seen as a linear version of the Helmholtz machine, and its parameters can be learned using the wake-sleep method, in which learning of the primary generative model is assisted by a recognition model, whose role is t...

متن کامل

Convergence of the Wake-Sleep Algorithm

The W-S (Wake-Sleep) algorithm is a simple learning rule for the models with hidden variables. It is shown that this algorithm can be applied to a factor analysis model which is a linear version of the Helmholtz machine. But even for a factor analysis model, the general convergence is not proved theoretically. In this article, we describe the geometrical understanding of the W-S algorithm in co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015